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DETAILED ACTION 
Specification 

The amendment filed 1 1/2/2006 is objected to under 35 U.S.C. 132(a) because it 
introduces new matter into the disclosure. 35 U.S.C. 132(a) states that no amendment shall 
introduce new matter into the disclosure of the invention. The added material which is not 
supported by the original disclosure is as follows: SEQ ID NOs 14-16. There is no support for 
the entirety of these sequences in the specification as filed (SEQ ID NOs 14 and 15 are 10 Kb, 
and SEQ ID NO 1 6 is 1 . 1 . Kb). If they are the GenBank Accession Nos. referenced on pages 23 
and 24 of the specification, in the context of the specification the Accession Nos. are used only 
to indicate where within the Accession Nos the specific primers in the PCR anneal, it is not even 
disclosed that the Accession Nos. were the actual PCR template (e.g. page 23, lines 19-26). 
Hence, the only support found for any sequences from these Accession Nos. is the inherent 
support that the PCR primers used in the specification anneal at specific regions in the Accession 
Nos. This does not provide support for entire Accession Nos., which, at best, appear to be what 
is contained in the Sequence Listing. 

Applicant is required to cancel the new matter in the reply to this Office Action. 

Claim Rejections - 35 USC § 112 
The following is a quotation of the second paragraph of 35 U.S.C. 1 12: 

The specification shall conclude with one or more claims particularly pointing out and distinctly claiming the 
subject matter which the applicant regards as his invention. 



Application/Control Number: 10/777,010 Page 3 

Art Unit: 1633 

Claims 50-52 and 59-61 are rejected under 35 U.S.C. 112, second paragraph, as being 
indefinite for failing to particularly point out and distinctly claim the subject matter which 
applicant regards as the invention. 

Claims 50-52 and 59-61 recite tRNA sequences comprising sequences found between 
specific base pairs of Genbank Accession Nos. The specification provides no guidance on 
which, if any, of the SEQ ID NOs in the Sequence Listing corresponds to the Genbank 
Accession Nos. recited in the claims. Furthermore, because the content of Genbank entries can 
(and do) change over time due to corrections, it cannot be determined if the sequences 
represented by the SEQ ID NOs (if they are the recited Genbank Nos.) are the actual DNA 
sequences as they existed at the time of filing of the instant application (applicants claim priority 
to 1999). Therefore, it cannot be determined what the actual tRNA sequences must be in order 
to anticipate the claims, preventing a satisfactory search of the claims. Thus, the metes and 
bounds of the claimed subject matter are unclear, and a search of the prior art regarding these 
specific sequences cannot be performed. 



The following is a quotation of the first paragraph of 35 U.S.C. 1 12: 

The specification shall contain a written description of the invention, and of the manner and process of making 
and using it, in such full, clear, concise, and exact terms as to enable any person skilled in the art to which it 
pertains, or with which it is most nearly connected, to make and use the same and shall set forth the best mode 
contemplated by the inventor of carrying out his invention. 

Claims 50-52 and 59-61 are rejected under 35 U.S.C. 1 12, first paragraph, as failing to 
comply with the written description requirement. The claim(s) contains subject matter which 



was not described in the specification in such a way as to reasonably convey to one skilled in the 
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relevant art that the inventor(s), at the time the application was filed, had possession of the 
claimed invention. 

The recitation of tRNA sequences comprising sequences of Genbank Accession Nos. in 
claims 50-52 and 59-61 is an attempt to incorporate essential, claimed subject matter by 
reference to prior art documents (i.e. the Genbank entries). A review of the disclosure as filed 
does not reveal these DNA sequences, thus there is no evidence applicants had possession of 
these specific DNA sequences. The incorporation of essential material in the specification by 
reference to an unpublished U.S. application, foreign application or patent, or to a publication is 
improper. See MPEP §608.0 l(p) (I). Applicant is required to amend the disclosure to include 
the material incorporated by reference, if the material is relied upon to overcome any objection, 
rejection, or other requirement imposed by the Office. The amendment must be accompanied by 
a statement executed by the applicant, or a practitioner representing the applicant, stating that the 
material being inserted is the material previously incorporated by reference and that the 
amendment contains no new matter. 37 CFR 1.57(f). 

The attempt to incorporate subject matter into this application by reference to the 
Genbank Accession Nos. recited in claims 50-52 and 59-61 is ineffective because the Accession 
Nos. are non-patent prior art documents (i.e. not a U.S. Patent or published U.S. application). 
The incorporation by reference will not be effective until correction is made to comply with 37 
CFR 1.57(b), (c), or (d). If the incorporated material is relied upon to meet any outstanding 
objection, rejection, or other requirement imposed by the Office, the correction must be made 
within any time period set by the Office for responding to the objection, rejection, or other 
requirement for the incorporation to be effective. Compliance will not be held in abeyance with 
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respect to responding to the objection, rejection, or other requirement for the incorporation to be 
effective. In no case may the correction be made later than the close of prosecution as defined in 
37 CFR LI 14(b), or abandonment of the application, whichever occurs earlier. 
Any correction inserting material by amendment that was previously incorporated by reference 
must be accompanied by a statement that the material being inserted is the material incorporated 
by reference and the amendment contains no new matter. 37 CFR 1.57(f). 



Claim Rejections - 35 USC § 103 
The following is a quotation of 35 U.S.C. 103(a) which forms the basis for all 

obviousness rejections set forth in this Office action: 

(a) A patent may not be obtained though the invention is not identically disclosed or described as set forth in 
section 102 of this title, if the differences between the subject matter sought to be patented and the prior art are 
such that the subject matter as a whole would have been obvious at the time the invention was made to a person 
having ordinary skill in the art to which said subject matter pertains. Patentability shall not be negatived by the 
manner in which the invention was made. 

Claims 45-48, 53-56, and 58 are rejected under 35 U.S.C. 103(a) as being unpatentable 
over Del Tito et al (cited in the IDS of 2/1 1/2004) in view of Nakamura et al, Zhang et al, Saier, 
Kawakami et al, Clouthier et al (see applicants' exhibits A-G in the response filed 9/26/2002 in 
parent application 09/492,590) and Sprinzl et al (Nuc. Acids Res., 1998). 

Del Tito et al teach the construction and use of a plasmid, pRI952, which comprises an 
array of two tRNA genes (argU and IleX) encoding tRNAs specific for the rarely used codons 
AGG/AGA and AUA, respectively (e.g. page 7087, paragraph 2; Tables I and II). The authors 
teach that PRI952 was constructed by insertion of a PCR-amplified DNA comprising the gene 
(comprising the endogenous tRNA promoter) for ileX flanked by Hindlll restriction sites into 
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pDC592, a pACYC184 derivative (i.e. low copy number plasmid comprising the tet promoter 
and chloramphenicol resistance gene, see description and diagram of pACYC184, GenBank 
accession number X06403) already possessing the argU gene (e.g. pages 7087, column 2, 
paragraph 2). Del Tito et al teach that coexpression of the two tRNA genes along with the gene 
encoding the heterologous polypeptide Mup r IRS results in increased levels of active protein as 
compared to a control in which no additional tRNA genes are expressed or as compared to cells 
comprising a plasmid only expressing the ileX gene (e.g Table II). Del Tito et al teach that 

. .problems in expression can be avoided by a carefiil inspection of the coding sequence and 
inclusion of appropriate tRNA genes or necessary site-specific mutations." (page 7087, column 
1, paragraph 2). The authors conclude that the co-expression of minor tRNAs such as ileX or 
argU can be utilized to overcome translational stresses due to the presence of rarely used codons 
within the coding sequence for a gene of interest (e.g. page 7091, column 1, paragraph 3). Del 
Tito et al teach the purification by reverse-phase HPLC of another heterologous polypeptide (i.e. 
the B/LeeHA antigen) produced by their system for compensating for the presence of rare 
codons in the coding sequence for the desired polypeptide (e.g. page 7088, column 1, paragraphs 
3-4). 

Del Tito et al do not explicitly teach the use of a vector comprising an array of the three 
tRNAs argU, ileY, and leuW. 

Nakamura et al (Nucleic Acids Research, 1996, Vol. 24, pages 214-215; see the entire 
reference) provide codon usage data tabulated from the GenBank international DNA sequence 
databases for 4,805 species (e.g. prokaryotes, protozoa, fungi, animals and plants). 
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Zhang et al (Gene, 1991, pages 61-72, see the entire reference) detail low usage codons in 
species as diverse as E. coli, yeast, Drosophila and primates. 

Saier, M. H. (FEBS, 1995, Vol. 362, pages 1-4; see the entire document) teaches the rare 
codon usage in several different species (e.g. R. capsulatus, R. speriodes, C. acetobutylicum, S. 
coelicular and E. coli). 

Sprinzl et al (Nucleic Acids Research, 1998, Vol. 26, , pages 148-153); see the entire 
document) teach a compilation of 3,279 sequences of tRNA genes (including the E. coli argU, 
ileY, and leuW genes) including cellular and mitochondrial tRNAs from bacteria and phage, 
plants, yeasts and fungi, insects, amphibians and mammals, including rats, mice, cows and 
humans. 

Kawakami et al (1993, Genetics, Vol. 135, pages 309-320; see the entire document) teach 
a rare Arg-tRNA-CCU in S. cerevisiae). 

Clouthier et al (J. Bacteriology, 1998, Vol. 180, pages 840-845, see the entire document) 
teach a rare Arg-tRNA-AGA from S. enteritidis. 

It would have been obvious to one of ordinary skill in the art at the time of the invention 
to modify the vector construct taught by Del Tito et al for compensating for the presence of 
rarely used codons present in the gene encoding a protein of interest by interchanging and/or 
adding different tRNA genes corresponding to other rarely used codons in a given cell type, 
because Del Tito et al teach that it is within the skill of the art to carefully scrutinize the coding 
sequence of a protein, identify rarely used codons and compensate for the presence of such rarely 
codons by supplying in trans the tRNA corresponding to the identified rarely used codons from a 
vector expressing different tRNA genes, and because the rarely used codons and corresponding 
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genes were widely known in the art (i.e. the teachings of Nakamura et al, Zhang et al, Saier, 
Sprinzl et al, Kawakami et al and Clouthier et al). One would have been motivated to do so in 
order to meet the particular rare-codon requirements of a gene encoding a desired protein in 
combination with a given cell type, and thus receive the expected benefit of increasing its 
expression in the given cell type, as taught by Del Tito et al. Absent any evidence to the 
contrary, there would have been a reasonable expectation of success in utilizing any tRNA gene 
obtained from any cell type that was known in the art (i.e. ileY, proL, leuW, etc.) in the approach 
taught by Del Tito et al to increase the production of a desired protein that comprises rarely used 
codons. 

Regarding the limitations of instant claims 46, 47, 55, and 56, these are matters of design 
choice in the preparation of the vector. Del Tito et al teach the insertion of the ileX gene, 
flanked by Hindlll sites, into the pDC592 vector. Thus, the gene could have been inserted in 
either orientation relative to the argU gene. Such choices are dependent upon the method steps 
used to construct the vector, the sequence of the tRNA gene chosen, and the location of 
convenient restriction sites in the chosen vector. Applicants choice of an order of argU, ileY, 
and leuW appears arbitrary from a reading of the specification and imparts no discernable 
advantage to the vector. The same is true for the orientation of the ileY gene relative to the other 
tRNAs. Given that the basic concept that is the crucial element of the invention was already 
known in the art (i.e. providing tRNAs corresponding to rarely used codons from DNA 
constructs comprising the cognate tRNA genes), that applicants' invention differs only in the 
make-up of the DNA constructs that are used to express the tRNA genes, and that the means and 
materials for making the changes necessary to the constructs taught by Del Tito et al in order to 
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arrive at the claimed invention, it is the examiner's conclusion that there is no significant 
contribution from the instant application that was not already readily available from the prior art. 

Claims 49 and 57 are rejected under 35 U.S.C. 103(a) as being unpatentable over Del 
Tito et al, Nakamura et al, Zhang et al, Saier, Kawakami et al, Clouthier et al, and Sprinzl et al as 
applied to claims 45-48, 53-56, and 58 above, and further in view of Skerra et al (U.S. 5,849,576, 
1998). 

The teachings of el Tito et al, Nakamura et al, Zhang et al, Saier, Kawakami et al, 
Clouthier et al, and Sprinzl et al are as above and applied as before, except: Del Tito et al teach 
that the expression of tRNA genes has been shown to be deleterious to the host cell and that for 
this reason the ileX promoter was used to control expression of the ileX gene from low-copy 
number plasmids (page 7090, column 2, K 3). 

None of the above references teach the use of the tet promoter operably linked to the 
tRNA genes. 

Skerra et al teach the use of the tet promoter to control expression of toxic genes in E. 
coli (see entire document, particularly the abstract and column 2, lines 49-55 ). 

It would have been obvious to one of ordinary skill in the art at the time of the invention 
to use the tet promoter (and the regulable tetracycline expression system) described by Skerra et 
al with the vectors of Del Tito because Del Tito et al teach that the expression of tRNA genes in 
E. coli can have a negative effect on the host cell. Skerra et al teach that a system for the tightly 
controlled expression of target, toxic genes in E. coli was well known and widely used within the 
art for the expression of toxic genes in E. coli. One would have been motivated to do so in order 
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to receive the expected benefit of avoiding any potential toxic effects associated with the 
expression of the tRNA genes in E. coli during periods when expression of said tRNA genes was 
not required. Absent any evidence to the contrary, there would have been a reasonable 
expectation of success in utilizing the tet promoter expression system for the controlled 
expression of tRNA genes in E. coli for the purpose of expression of desired polypeptides whose 
coding sequence comprises a number of different, rarely used codons. 

Claim Rejections - 35 USC § 101 

35 U.S.C. 101 reads as follows: 

Whoever invents or discovers any new and useful process, machine, manufacture, or composition of matter, or 
any new and useful improvement thereof, may obtain a patent therefor, subject to the conditions and 
requirements of this title. 

Claims 54 and 58 are rejected under 35 U.S.C. 101 because the claimed invention is 
directed to non-statutory subject matter. The nucleic acid is not recited as purified or isolated, 
and thus reads on the E. coli genome, a product of nature. According to the specification and 
claims, the E. coli genome contains all three of the tRNA genes specified in the claim. 

Conclusion 

No claims are allowed. 

Any inquiry concerning this communication or earlier communications from the 
examiner should be directed to Michael D. Burkhart whose telephone number is (571) 272-2915. 
The examiner can normally be reached on M-F 8AM-5PM. 

If attempts to reach the examiner by telephone are unsuccessful, the examiner's 
supervisor, Joseph Woitach can be reached on (571) 272-0739. The fax phone number for the 
organization where this application or proceeding is assigned is 571-273-8300. 
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Information regarding the status of an application may be obtained from the Patent 
Application Information Retrieval (PAIR) system. Status information for published applications 
may be obtained from either Private PAIR or Public PAIR. Status information for unpublished 
applications is available through Private PAIR only. For more information about the PAIR 
system, see http://pair-direct.uspto.gov. Should you have questions on access to the Private PAIR 
system, contact the Electronic Business Center (EBC) at 866-217-9197 (toll-free). If you would 
like assistance from a USPTO Customer Service Representative or access to the automated 
information system, call 800-786-9199 (IN USA OR CANADA) or 571-272-1000. 
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Examiner 
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pACYC184: description & restriction map 



Related Documents: 

GenBank/EMBL accession number X06403. 

Restriction sites 
Sequence 

Not available from Fermentas 



The 4245 bp plasmid, pACYCl84, is compatible with pMBl- or ColE I -related plasmids 
and can therefore be used together with a pMBl - or ColE l -derivative within the same cell. 
pACYCl84 contains: (1) the replicon rep responsible for the replication of plasmid (source 
- plasmid pi 5 A); (2) tet gene, encoding tetracycline resistance protein (source - plasmid 
pSClOl); (3) cat gene, coding for chloramphenicol acetyl transferase that confers resistance 
to chloramphenicol (source - transposon Tn9). 

The circular sequence is numbered such that 1 is the first G of the unique EcoRl site 
GAATTC and the count increases first through the S'-terminal part of cat gene, the pi 5 A 
material, the tet gene and finally through the 3'-terminal part of cat gene. The map shows 
enzymes that cut pACYC184 DNA once. Enzymes produced by Fermentas are shown in 
blue. The coordinates refer to the position of first nucleotide in each recognition sequence. 

The exact position of genetic elements is shown on the map (termination codons included). 
The indicated rep region is sufficient to promote replication. DNA replication initiates at 
position 846 (+/- 1) and proceeds in the direction indicated. 

Derivatives of pi 5 A are normally present in a somewhat lower copy number than pMBl 
derivatives and cannot be amplified to the same extent as pMBl derivatives. Due to the 
presence of the cat gene on pACYC184, amplification of this plasmid should be performed 
using spectinomycin. 
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cloning vehicles derived from the P15A cryptic miniplasmid, J. Bacteriol., 134, 1 141-1 156, 1978. 
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Enzymes which cut pACYC184 DNA once: 

Aasl 3696, Bam HI 1870, Bell 3542, Bfml 1633, BseJT 3442, BseMi 4234, BseSI 3779, 
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ABSTRACT 

Sequences of 3279 sequences of tRNA genes and tRNAs 
published up to December 1996 are included in the 
compilation. Alignment of the sequences, which is most 
compatible with the tRNA phylogeny and known three- 
dimensional structures of tRNA, is used. Sequences and 
references are available under http://www.uni-bay- 
reuth.de/departments/biochemie/tma/ 



INTRODUCTION 

The 1997 compilation contains 3279 sequences of tRNAs and 
tRNA genes. The last edition which appeared two years ago (1) 
was supplemented by 579 new sequences covering the literature 
up to December 1995. The sequences of tRNA mutants and of 
tRNAs originating from transformed or differentiated cells were 
not considered. 

The tRNAs included in the compilation are listed in Table 1 . 
Each tRNA or tRNA gene is specified by the (abbreviated) name 
of the organism from which it was isolated and a four digit code: 
the first three digits identify the organism, the last digit specifies 
the particular isoacceptor. The amino acid specificity of the tRNA 
is indicated by a one-letter amino acid code. The tRNAs coding 
for selenocysteine were annotated with the letter Z. Initiator 
tRNAs are annotated with the letter X. 

The references are restricted to the first publication of the 
complete sequence unless additional information (e.g., base 
modification, corrections, etc.) was later obtained. In such cases 
additional references were added. 

In order to facilitate a computer analysis an alignment is used 
which is most compatible with the tRNA phylogeny and known 
three-dimensional structures of tRNA. The corresponding 
numbering system is shown in Figure 1 . 

As was the case in the previous edition ( 1 ), this publication does 
not contain a sequence printout. Instead, the sequences, refer- 
ences and footnotes of tRNAs and tRNA genes listed in Table 1 
are deposited in the European Bioinformatics Institute (EBI) Data 
Library. In addition, a World Wide Web page has been established 
and is available under http://www.uni-bayreuth.de/departments/ 
biochemie/trna/ . The present publication should be quoted as a 
reference for the electronically accessible data. 




Figure 1. Numbering of nucleotides in tRNAs. Circles represent nucleotides 
which are always present; the ovals, nucleotides which are not present in each 
structure: these are nucleotides before the position 1 on the 5'-end, before and 
after the two invariant GMP residues 18 and 19 in the D-loop, and the 
nucleotides in the variable loop. The nucleotide to be added at a given site is 
indicated by the number of the preceeding nucleotide followed by a colon and 
a letter in alphabetical order. The nucleotides in the variable stem have the prefix 
'e' and are located between position 45 and 46 obeying the base-pairing rules. 
The nucleotides in the S'-strand and the 3'-strand are numbered by ell, el 2, 
el 3, ... and e21, e22, e23, respectively; the second digit identifies the 
base-pair. In the case of a long variable region, the loop can be formed by up 
to 5 nt: el, e2, e3, e4 and e5. Positions, in which invariant nucleotides usually 
occur are indicated by a thick line. 

Researchers who wish to perform an advanced search for tRNA 
sequences according to several criteria, e.g., anticodon, amino 
acid specificity, modified nucleoside, or wish to print the 
requested sequence in the form of Table 2 or cloverleaf format 
(Fig. 1) can obtain appropriate software on diskette. Please 
contact M. Sprinzl, Laboratorium fur Biochemie, Universitat 
Bayreuth, D-95440 Bayreuth, Germany, Fax: +49 921 552432, 
Email: Mathias.Sprin2l@uni-bayreuth.de. 



*To whom correspondence should be addressed. Tel: +49 921 552 420; Fax: +49 921 552 432; Email: mathias.sprin2l@uni-bayreuth.de 
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Table 1. List of tRNA sequences and sequences of tRNA genes included in the compilation 



PART ONE: S«qnsom 1BNA geatf 



Source 


Cods 


tRNA gates 


VIRUSES 


000429 




MYCQBACTERIOPH. L5 


020 


NOW 


PHAGE PHI a 1 


031 




PHAGE T4 


022 


CTLPQRST 


PHAGE T5 


026 


ACTJEWmiRT>MNPQSSTVWXY 


ARCHAEBACTERIA 


030-109 




ARCHAE GLOBUS FULQ. ' 


034 


A 


HALOBACTERIUM cut. 


038 


AC 


flALO BACTERIUM HAL. 


042 


A 


HALOBACTERIUM MAR. 


044 


LS 


HALOBACTERIUM MED. 


046 


W 


HALOFERAX VOLCANn 


050 


CW 


METHANOBACJORML 


058 


A 


METHANOBAC.THERM. 


062 


A 


METHANOCOCCUSJAN. 


065 




AACDEFGOTIUXKMMNQPPRRRSSSriVVVWXY 


METHANOCOC VANl. 


066 


ADEITOKLNPQRTTVY 


METH ANOTHRJX SOBH. 


067 


A 


METHANOTHERM. FER. 


068 


ADKHIKLMNPST 


RUMIKOBACTER amylo 


070 


E 


methanococ.voltab 


074 


DKPTY 


METHANOPYRUS KAND. 


076 


KLQS 


METHANOSPLR. HUNO. 


078 


A 


SULFOLOBUS solfa 


086 


FGLSVX 


THERMOPLASMA acid. 


090 


M 


THERM OCOCCUS CBLHR 


094 


APT 


THERMOFIL. PENDENS 


096 


GM 


THERMOPROT. TENAX 


098 


AALLX 


EUBACTERIA 


110-239 


BARTONELLA BACIL 


no 


IX 


BARTONELLA EUZAB. 


111 


AI 


BARTONELLA HENSBLA 


112 


I 


BARTONELLA QUINT. 


113 


AI 


MYCOPLASMA CAPRIC 


114 


ACDBTGHIlKKLIi^PQRRSSTTVWWXY 


MYCOPLASMA GEN. 


115 


ACOEFOGHIIKKU^MNPQRRSSSSTrTWWY 


MYCOPLASMA MYCOID. 


118 


AD EFOIMN PRRSTVX 


MYCOPLASMA PNEU. 


120 


ACDBEGGHlIKRTJLMNPQRIUASSSSrrmAVXYY 


MYCOPLASMA PG50 


122 


KL 


ACHOLEPLASMA LAID. 


123 


AACJJFJFC1HI1KK3XLMMNQRSSTVW 


SPlROPLASMACmU 


125 


SWW 


SPJROPLASMA MEUP. 


126 


ACDFIMPRSX 


BORREL1A BURGDORF. 


128 


AI 


STREPTOMYCBS GRIS. 


130 


s 


STREPTOMYCES COEL. 


131 


L 


STREPTOMYCBS RIM 


134 


EQQXX 


STREPTOMYCES LIV. 


135 


CDEEEGGKNNQQRSWY 


STREPTOMYCES AM BO. 


136 


P 


CHLOSTRIDnjM PBRFR. 


139 


S 


MYCOBACT. TUBERC, 


140 


PV 


KLEBSIELLA AEROGE 


141 


N 


AGROBACTER.TUME. 


142 


R 


CLOSTRIDIUM THERM. 


143 


Z 


DESULFOMICR. BACU. 


144 


Z 


CLOSTRIDIUM ACETO. 


145 


T 


PLBSIOMONASSHiaE. 


146 


B 


ENTEROCOCCUS HIRAB 


147 


A 


STAPHYLOCOC.AURE. 


148 


ACDDFGGOGHlKliXMPQRSSTTVVWXY 


LACTOBAC. BULG. 


150 


DEGNPRSV 


LACTOBAC.DBLBRUEC. 


152 


S 


LACTOCOCCUS LACT1S 


153 


AAAEFGiNSX 


BACILLUS SUBTCLLS 


154 


AAAACDEFFGGGH H UIKX1 ,1 ,1 1 .1 .1 .MMNNPQ 






RSSSTTTVWXXY 


BACILLUS aRCULANS 


156 


P 


BACILLUS SP. PS3 


157 


DENSV 


THERMUS THERMOPH1 


158 


GGTTY 


THERMOTOQA MARIT. 


159 


MMTWY. 


RHODOTHERMUS MAR. 


160 


AI 


thiobacillus ferro 


162 


AI 


stigmatella aurant. 


163 


GTTY 


E.COU 


166 


AACDFJGGGHinOlJJJJrfNPPPQQRRRRR 






SSSSTITTTVVVWXXYYZ 


SALMONELLA TYPHI. 


170 


HLPRR 


AZOSPIRILLUM LIPO. 


172 


KTV 


TRICHODESMIUM SPEC 


173 


AI 


PHOTO BACT. PHOSPH. 


174 


HP 


PHOTOBAC. LBIOGNA 


175 


LM 


AEROMONAS hydroph 


178 


AEH3LPR 


PSEUDOMONAS AER, 


in 


AGTTTY 



PSEUDOMONAS FLUOR. ' 


184 


AI 


CAMPYLOBACJEJUNl 


186 


AI 


RICKETTSIA PROW. 


187 


GWY 


CAULOBACTBR CRES. 


189 


AI 


BRUCELLA SU1S 


190 


AI 


BP^iry|j r ^ mto jjtkmr 


191 


AI 


BRUCELLA ABORTUS 


192 


AAA 


AZORHEOBIUM CAUL: 


193 


O 


RHEQBIUM MELILOTL 


194 


L 


AZOARCUS SP.BH72 


195 


a, 


OCHROBACTRUM ANTH. 


196 


AI 


BORDBTHLLA PERTUS. 


198 


L 


HAEMOPHILUS INFLU. 


200 


AAAACDDDEFGOGHIIIKKKKLLLLLMMM 






NNPQQRRRRSSSSTTVVWY 


AKACYSTISNIDULANS 


210 


AI 


SYNECHOCYSnS SP. 


214 


AACDFGOGniHKKLLLLLNPPP 






QRRRKSSSST1TVVWWXXY 


SYNECHOCOCCUS SP. 


215 


L 


CYANOPHORA FARAD. 


218 


AEGH1LRS 


PYLAIELLALITTORA 


222 


AI 


STREPTOCOCCUS FN. 


224 


A 


STREPTOCOCCUS SAL. 


225 


A 


ORGANELLES 


CHLOROPLASTS 


240-359 . 


CYANOPHORA PARAD. 


240 


AI 


PYLAIELLALITTORA. 


241 


Al 


CHLAMYDOMONAS RETN 


244 


ACDEGIMRRTW 


CHLAMYDOMO. MOEWU. 


246 


T 


CHLORBLLA BLLIPSO. 


248 


AIRS 


LYCOPERSICaNBSCU. 


249 


DLY 


CUCUMtS SATTVUS 


250 


E 


ASTASIA LONGA 


251 


ACDGIKLMPQRSSTV 


EUGLENA GRACILIS 


252 


AACDFJGGHnKLLLMNFXJRRSSTVWXXY 


CRYPTOMCXNAS SPEC. 


254 


AIR 


SPIROOYRA MAXIMA 


255 


I 


ANTTTHAMKION SP. 


257 


AI 


CYAN1DIUM CALDAR. 


258 


AIK 


OLISTHODISCUS LUT. 


259 


AI 


MARCHANTIA POLYM. 


260 


ACDEFGGHIIKlXIJ«£NPPQRRRSS^rTVVWXY 


CUSCUTA REFLBXA 
COLEOCHABTE ORBIC. 


261 


AH1LMV 
AI 


HORuEUM VULGARB 


262 
264 


GGMSTVX 


TRTTICUM ABSTFVUM 


268 


CDEGGMPRSTWXY 


ORYZA SAT1 VA 


.270 


ACCDEKXMILLLMMNPQRRSSSTTVVWY 


ZBAMAYS 


272 


AACDEFGGKHDKLLLLMN 






PQRRSSSSSTTTVVVWXXY 


BPB7AGUS VIRGINIA 


274 


LNR 


ARABIDOPSIS THAL. 


276 


IMP 


ALLIUM PORRUM 


278 


R 


BRASS1CA OLERACEA 


280 


L 


GLYCINB MAX 


284 


AIMV 


MEDICAGO SATTVA 


288 


H 


NICOT1ANA TABACUM 


292 


ACDEFGGlBnKLLLMNPQRRSSSTTVVWXY 


NICOT1ANA DEBNEYI 


296 


H 


OENOTHERA SP. 


300 


PW 


DAUCUSCAROTA 


301 


V 


GOSSYPIUM HmSUTUM 


302 


H 


PELARGONIUM ZONALB 


304 


R 


PENNISBTUM AMERICA 


308 


I 


PETUNIA HYBRIDA 


312 


H 


PHASBOLUS VULGARIS 


316 


H 


HELIANTHUS ANNUUS 


317 


HNY 


PISUM SATIVUM 


320 


DEGHKLN PRRSTVWXY 


FWUS THUNBERGII 


322 


ACDEPGGHI1KL1XMNPPQRRRSSSTIVVWXY 


P1NUS CONTORTA 


323 


HK 


SINAPIS ALBA 


324 


HKQSV 


SPIN ACIA OLERACEA 


328 


ACDEH IlLMRSSTTVY 


SPIRODELA OUOORH. 


332 


NRR 


VIOAFABA 


336 


EFHLLTY 


SORGHUM BICOLOR 


340 


L 


MITOCHONDRIA 


360-599 


SINGLE CELL ORGANISMS 


360-419 


AND FUNGI 






PROTOTHBCA WICKER. 


360 


ACDBPCKIHUK1JA1NPQRRSSTVWXY 


PYLAIELLALnTOR. 


361 


KPY 


CHONDRUSCRISFUS 


362 


ACEGGHIKLLMNPQRRSVWXY 


PLATYMONAS SUBCORD. 


363 


KNPVY 


CHLAMYDOMO. RETNH. 


364 


MQW 


ODONTELLA SINENSIS 


365 


AACDBFGGHUIKLLNPPQRRSSTVWXY 


PLASMODIUM FALC1P. 


366 


CDEGGHKLPQSSWXY 


TRYPANOSOMA BRUCEI 


368 


AA 


LEPTOMONAS COLLO. 


369 


H 
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Table 1. continued 



PARAMECIUM PRIM. 


372 


XY 


Paramecium tetra. 


376 


WY 


PARAMECIUM AUREUA 


377 


PWY 


TETRAIIYMENA PYRIF. 


380 


BFHLWX 


TETRAHYMENA THERM. 


384 


LXY 


ASPERGILLUS FUML 


387 


EMMTV 


ASPERGILLUS NIDUL. 


388 


ACCDEFGGHIKLLMMNPQRSSTVWXY 


N EUROSPORA CRASSA 


392 


ACMR 


r\J IXJo rVJ KA AN SBnlrlA 




umnavw 


POOOSPORA CURVICOL 


397 


N 


SACCHAROM YCES CER. 


400 


AACOEFGH1KLMNPQRRSSTTVVWXYY 


SACCHAROMYCBS EX1. 


401 


MP 


PICHIA PUPERI 


402 


LMM 


W1LU0PSIS MRAKII 


403 


KLMPQSV 


SCHIZOSACCHAJ*OM. 


404 


GHLPQ 


KLUYVEROMYCES LAC. 


405 


CKLQ 


CANDIDA PARAPSLO. 


406 


CEFGH1KLNPRRTVWY 


HANSENULA WINGE1 


407 


ACCDEFGHIKUXMMMMNPQRRRSSTVVWY 


TDRULOPSIS GLAB. 


408 


ACDEFGHlKlJrfNPQRSSTTVWXY 


W1LU0PSIS SUAVE 


409 


M 


PICHIAJADINII 


410 


M 


TRICHOPHYTON MENT. 


409 


APLMMTV 


TRICHOPHYTON RUBR. 


412 


DOIKMQRWY 


PENICILUUM CHRYS. 


413 


NRY 


ASCOBOLUS IMMERSUS 


415 


NNQ 


PLANTS 


420-459 


ARABIDOPSISTHAL. 


424 


EMQSSY 


GLYCINE MAX 


428 


EMX 


SOLAN UM LYCOPERS. 


430 


C 


SOLAN UM TUBEROSUM 


431 


X 


LUPINUS LUTEUS 


432 


CHNX 


BRASSICANAPUS 


434 


K 


OENOTHERA SP. 


436 


CFGHILNPSSSWXY 


PHASEOLUS VULGARIS 


440 


NSY 


HELIANTHUS ANNUUS 


441 


CEGHIKMNPQVWX 


TRmCUM AESTIVUM 


444 


CDEFKNPQQSSSWXYYY 


ORYZA SATIVA 


446 


FHNPRSW 


2EAMAYS 


448 


CDEHKMMPSSWXY 


MARCHANT1A POLYM. 


450 


ACDEFOOHIKLLlJtfMNPQRRRSSTVWY 


LARIX . 


452 


HH 


ANIMALS 


460-599 


FASCIOLA HEPATICA 


462 


AD1KNPSW 


ASCARIS SUUM 


464 


ACDEFQHIKLLNPQRSSTVWXY 


CAENORHABDI.ELEQ. 


468 


ACDBFGHIKLLNPQRSSTVWXY 


MYT1LUS EDUUS 


470 


ACDBPGHIKLLMMNPQRSSTVWY 


ARTEM1A SP. 


472 




LOCUSTA MIGRATORIA 


476 


ACDDEFGGBIKIG-LLLNPPQRSSTTVWXY 


PSEUDOREGMA BAMBU. 


477 


L 


METRIDIUM SENILE 


478 


X 


NliPHJLA CLAVIPES 


479 


AAAA 


AEDES ALBOPICTUS 


480 


AEFGLNRSV 


LOLIGO BLEEKERI 


481 


KKKKK 


APIS MELLIFKRA 


482 


ACDDEFGHIKLLMPQRSSVWY 


DAPHNIA PULEX 


483 


IQVWXY 


DROSOPHILA MELANO. 


484 


ACCDDEFCK3H1KKIXLQRSSTVWWXYY 


DROSOPHILA YAKUBA 


4S8 


ACDEFGHIKLLNPQRSSTVWXY 


DROSOPHILA VIRIUS 


496 


IQX 


CHORISTONEURAFUM. 


497 


L 


PISASTER OCHRACEUS 


498 


ACDEGLLNPQTVWXY 


PROTOPTERUS DOLLOl 


499 


ACDEFGHIKLLMNPQRSTVWy 


ASTERINA PECTTNI. 


500 


ACDGHLLMNPQSS VW Y 


CERATITIS CAPITATA 


501 


AEFNRS 


ASTER1AS FORBESII 


502 


ACDGLLNVWXY 


CYPR1NUSCARPIO 


503 


ACDEFGHI ICKU.NPQRSSTVWXY 


PARACENTROTUS UV. 


504 


ACDEFFGHIKLLNPQRSSEVWXY 


ANOPHELES QUADRIM. 


505 


ACDEFGHI KLLNPQRSSTVWXY 


RAINBOW TROUT 


506 


FPT 


ANAS PLATYRHYNCOS 


507 


ACDEFKLNSWY 


STRONG YLOCENPURP. 


508 


ACDEFQHIKLLNPQRSSTVWXY 


AC1PENSER transm. 


509 


PT 


gadus morhua 


510 


ACDEFGHIRXLMNQRSSTVWY 


acanthamoeba cast. 


511 


ADE1KLMPQX 


XENOPUSLAEVIS 


512 


ACDEFFGHIKLLNPQRSSTVWXY 


ALUGATOR missis. 


513 


ACNWY 


CROCODYLUS POROSUS 


514 


ACNWY 


CARETTA CARETTA 


515 


ACNWY 


RAN A CATESBEIANA 


516 


ACFILNPQTWXY 


MALACLEM YS TERRA, 


517 


ACNWY 


SPHENODON PUNCTAT. 


518 


ACNY 


EPICRATES SUBFLA 


519 


ACKN 


CEPHALORHYN.COM. 


520 


FPT 


C ROSSO STOMA LACUS. 


521 


ACDEFGHIKLLNPQRSSTVWXY 


CHICKEN 


527 


ACDEFGHIKLLMNPQRSSTVWY 


D1DELPHIS VIRGIN!. 


523 


DPT 


ODOCOILEUS HEMIC 


524 


PT 


OICEROS BICORNIS 


525 


PP 


MARMOSA SP. 


526 


DPT 


PHILANDER SP. 


527 


D 


RAT 


528 


ACO)DEFGHimi>r>WPPQQR5SSTrVWWXXY 



METACHIRLIS SP. 


529 


D 


PHALANGER SP. 


530 


D 


CNEDIM0PH0RU5 UNI. 


531 


EFT 


MOUSE 


532 


ACDEFGHIKLLNPQRSSTVWXY 


CERVUS NIPPON 


533 


PT 


BALAENOPTERA PHYS. 


534 


ACDEFGHIKLLPQRTVWXY 


BALAENOPTERA MUSC 


535 


ACDEFGHIKLLNPQRSTVWXY 


BOVINE 


536 


ACDEFGHIKLLNPQRSSTVWXY 


HAUCHOBRUS GRYPUS 


537 


ACDEFGLNPQRRSTVWXY 


PHOCA VTTULINA 


538 


ACDEFGHIKLLNPQRSTVWXY 


GADUS MORHUA 


539 


DEIPQST 


LEPIDOSIREN PARAD. 


542 


V 


RHINOCEROS UNICORN 


544 


ACDEFGHIKLLMNPQRTVWY 


SCELOPQRUS OCCID. 


545 


HILLMMVW 


STRUTHIO CAMELUS 


■ 550 


HILMRW 


EPJNACEUS EURO P. 


5S5 


ACDEFGH1KLMNPQRTVY 


MACACA ASSAM ENSES 


556 


HLS 


MACACA NIGRA 


557 


HL 


MACACA SJLENUS 


558 


HL 


MACACA THIBETAN A 


559 


HL 


GREEN MONKEY 


560 


P 


SLAMANO 


561 


ACDEIKNWXY 


MACACA FUSCATA 


562 


HLS 


MACACA MULATTA 


563 


HLS 


MACACA FASCICULA 


564 


HLS 


MACACA SYLVANUS 


565 


HLS 


SAIMIRI SCIUREUS 


566 


HLS 


PAPIO HAMADRYAS 


567 


HL 


TARS1US SYRICHTA 


568 


HLS 


LEMUR CATTA 


570 


HLS 


CHIMPANZEE 


572 


ACDEEFGH 1 KLLMN PQRSTVWXY 


PYGMY CHIMPANZEE 


573 


ACDEIKNWXY 


GIBBON 


576 


HLS 


GORILLA 


580 


ACDEEGH11KLMNPQRSTVWXY 


ORANGUTAN 


584 


ACDEEPGHIKLLLMNPRSSTVWXY 


HUMAN 


588 


AACCDEEFGHIKLLLMMNPQRRSSTVWX\ 


AEPYCEROS MELAMPUS 


590 


FV 


BOSELAPHUS TRAGOC. 


591 


FV 


CEPHALOPHUS MAXW. 


592 


FV 


DAMAUSCUS DORCAS 


593 


FV 


GAZELLA THOMSON! 


594 


FV 


KOBUSELLIPSIPRYM. 


595 


FV 


MADOQUA KIRKI 


596 


FV 


ORYX GAZELLA 


597 


FV 


TRAGELAPKUS IMBER 


598 


FV 



EUKARYOTIC CYTOPLASM 600-999 



SINGLE CELL ORGANISMS 600-669 
AND FUNGI 



PLASMODIUM FALSI. 


603 


AILMNRRTV 


TRYPANOSOMA BRUCE1 


605 


KKKNNQQRRRTWY 


TETRAHYMENA PYRIP. . 


606 


NQS 


LEISHMAN1A TARENT. 


609 


G1KLQRRTVW 


DICTYOSTELIUMDIS. 


616 


AEEHKKLMNQRRSSSrTWWWY 


PHYSARUM POLYCEPH. 


618 


X 


NEUROSPORA CRASSA 


620 


FLR 


CANDIDA ALBICANS 


621 


LSS 


PHYTOPHTHORA PAR 


622 


D 


PODOSPORA ANSERINA 


624 


ss 


SACCHAROMYCES CER. 


628 


AAACDDEEEFFFFGGHIIKKKLLLMMNP1 






QQQRRRSSSSSTTTVVVWWWXXYY 


SCHEOSACCHAPOM 


632 


ADEEFHIKRRSSSVXX 


CANDIDA CYLINDRA. 


637 


S 


PLANTS 


670-749 




CHLAMYDIA TRACHOM. 


672 


TW 


ARAfilDOPSISTHAL. 


674 


APSSSSSSVWWXYYYY 


GLYCINE MAX 


690 


DMX 


PHASEOLUS VULGARIS 


698 


LPP 


NICOTIANA RUSnCA 


706 


SSSSSSSYY 


PETUNIA SP. 


710 


N 


HELIANTHUS ANNUUS 


717 


L 


SORGHUM BICOLOR 


714 


G 


ORYZA SATIVA 


718 


0 


TRIT1 CUM AESTIVUM 


720 . 


YYYYY 


TRITICUM VULGARB 


724 


S 


SOYBEAN 


730 


C 


ANIMALS 


750-999 


CAENORHABDI. ELEG. 


756 


AAADEEOGHIKKKLLLLLLNPPP 






QQRRRRRSSTTVWWXYZ 


BOMBYX MORI 


768 


AAEGK 


DROSOPHILA MELANO. 


774 


ADEEEFGGHIKKLLMNPRRSSTVVXY2 


DROSOPHILA SIMUL. 


780 


S 


SQUID 


785 


K 


XENOPUS LAEV1S 


792 


AFKLNVXXYYYZ 


PODOCORYNB CARNEA 


793 


CFGSS 


CHICKEN 


804 


AADDFKPPWZ 


MOUSE 


810 


ACCDEGHKKLPPXZ 
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Table 1. continued 



RAT 


916 


DDKRKEEKHEEPOGKLLLPPQQQQQQ 


MARSUPIAL 


599 


D 


BOVINE 


518 










HUMAN 


999 


AEEGGGKKLLMNNPPQQQRR 


BUXARYOTIC CYTOPLASM 


600-999 








SSSS3TTTVVVVVVVXXYY 














SINGLE CELL ORGANISMS 


600-669 










AND FUNGI 






PART TWO: tRNA Swjoenetf 






EUGLENA GRACILIS 


604 


DP 








TETRAHYMBNA THERM. 


608 


OjQQX 


Source 


Code 


tRNA 


SCENEDESMUS OBUQ. 


612 


FXY 








NEUROSPORACRASSA 


620 


PX 


VIRUSES 


000-029 




SACCHAROMYCES CER, 


628 










ACDEFFGGHHlRTOLlAiNPPRRRSSSTrVVVWXY 


AVIAN ONCO.- VIRUS 


010 


M 


SCHEQSACCHA POM 


632 


EPXY 


CHICKEN ASV/AMV/RS 


014 


W 


TORULOPS1S UTIUS 


636 


AILPVXY 


MOUSE M-MUL V 


018 


PP 


CANDIDA CYUNDRA. 


637 


LLLSSSSS - 


PHAGE T4 


022 


GILPQRST 








PHAGE TJ 


026 


DLNPQ 


PLANTS 


670-749 




ARCHAEBACTBRIA 


030-109 




uno rvCT tw \n it r\ a t>t? 
n'Wl ffcVM VIJUuUAjld 


678 


BEF 








WHEAT GERM 


682 


PGKMRWXYY 


HALOBACTERIUM CUT. 


038 


AOHNQRSTWVX 


BRASSICA NAPUS 


686 


F 


HALOPERAX VOLCANU 


050 


AAACDEEFGGGGHIIKRLLLLLMNPPP 


LUPINUS LUTBUS 


694 


EFGHIMNPSVXY 






QRRRSSSTTVVWXY 


PHASEOLUS VULGARIS 


698 


LLLLX 


HALOCOCCUS MORRHUA 


034 


X 


PISUM SATIVUM 


702 


P 


MBTHANOBAC THERM 


062 


ON 


SPINAOA OLERACBA 


704 


S 


SULFOLOBUSAQDO. 


082 


X 


NICOTIANA RUSnCA 


706 


SSSSSYY 


THERMOPLASMA AC1DO 


090 


MX 


SOLANUM TUBEROSUM 


707 


LW 








CUCUMIS SATIVUS 


708 


L 


EUBACTERIA 


110-239 • 














ANIMALS 


750-999 


MYCOPLASMA CAP RIC. 


114 


ACDEFOmiKKLLLMNPQR&S STTVWWXY 








MYCOPLASMA MYCOID. 


118 


AG3PSTVX 


CAENORHABDI ELEG. 


756 


L 


SPlROPLASMACmU 


123 


WW 


ASTEBJN A AMURENSIS 


762 


X 


STREPTOMYCES ORIS. 


130 


X 


BOMB YX MORI 


768 


AAFPGGI 


STREPTOMYCES COBL. 


131 


G 


DROSOPHILA MELANO. 


774 


BFHKKSSSVWXY 


STAPH YLOCOC. EPID. 


138 


GO 


. EUPHAUSIA SPERBA 


786 


X 


MYCOBAC. SMEO. 


142 


X 


XENOPUSLAEVIS 


792 


DFX 


BACILLUS STEARO. 


146 


FLVY 


SALMON UVER 


798 


X 


BACILLUS SUBTILIS 


154 


AFG3KKLMPRSSSTVWXYY 


CHICKEN 


804 


w 


THERMUS THERMOPHl. 


158 


DPIMXX 


MOUSE 


810 


BFFF1KKMQQRRVXZ 


E.COLt 


166 


AAACDBEEFGGGHIIIKLLLMNQQ 


RAT 


916 


DDEKKKLLNNQSSSVVX 






RRRRRSSSSSTTVVVWXXYYZ 


RABBIT LIVER 


922 


DFKKKMV 


SALMONELLA TYPHI. 


170 


OOHLPPP 


BOVINE LIVER 


928 


DFFLLNQRRRTWYZ 


AZOSPIRILLUM UPO. 


172 


N 


CALF LIVER 


934 


F 


RHODOSPIRIL RUB. 


202 


FL 


COW MAMMARY GLAND 


940 


LL 


AGMENELLUMQUADR. 


206 


P 


SHEEP UVER 


946 


HX 


ANACYSTIS NiDULANS 


210 


LLX 


HUMAN 


999 


AAEFGGHLMNNQQSVXYYZ 


SYNECHOCYSTIS SP. 


214 


B 








ORGANELLES 












CHLOROPLASTS 


240-359 


PART THREE: tRNA and tRNA getw tequeocet that differ from The com 


CHLAMYDOMONAS REIN 


244 


E 


Source 


Code 


tRNAARNA gene 


EUGLENA GRACILIS 


252 


P 








CODIUM FRAGILB - 


253 


GKMR 


. ARCHAEBACTBRIA 


030-109 


SCENEDESMUS OBLIQ. 


256 


MXY 








LUPINUS ALBUS 


263 


Y 


METHANOCOCCUS JAN. 


065 


Z 


HORDEUMVULGARB 


264 


DDEQ 








TPJTtCUM ABSnVUM 


268 


B 








ZEAMAYS 


272 


I 


MITOCHONDRIA 


360-599 


GLYCINE MAX 


284 


LLL 








N1COTIANA TABACUM 


292 


W 


SINGLE CELL ORGANISMS 


360-419 


PHASEOLUS VULGARIS 


316 


FLLLWX 


AND FUNGI 






SPINAC1A OLERACBA 


328 


PTILMPTVWX 














PHYTOMONASSP. 


367 


Q 








TRICHOPHYTON MENT. 


409 


B 


MITOCHONDRIA 


360-599 














ANIMALS 


460-599 


SINGLE CELL ORGANISMS 


360-419 








AND FUNGI 






LOCUSTA MIGRATORIA 


476 


S 








APIS MBLUPERA 


■ 482 


T 


TETRAHYMBNA PYRIF. 


380 


PY 


DAPHNIAPULEX 


483 


C 


TETRAHYMENA THERM. 


384 


W 


DROSOPHILA MELANO. 


484 


P 


NEUROSPORACRASSA 


392 


ALLTVWXY 


PROTOPTERUS DOLLOI 


499 


S 


SACCHAROMYCES CBR, 


400 


PGHIKLMPRRSSSTWXY 


BALAENOPTERA PHYS. 


534 


NSS 








balaenoptbra musc. 


535 


S 


PLANTS 


420-459 


HAUCHOERUS GRYPUS 


537 


K 








PHOCA VrrULINA 


538 


S 


SOLANUM TUBEROSUM 


431 


IIL 


SIAMANG 


542 


S 


OENOTHERA SP. 


436 


P 


RHINOCEROS UNICORN 


544 


SS 


PHASEOLUS VULGARIS 


440 


PLLLLMPWXY 


SCELOPORUSOCaD. 


545 


ACS 








STRUTHIO CAMELUS 


550 


AST 


ANIMALS 


460-599 


ERINACFJUS EUROP. 


555 


SS 








MACACA THIBETAN A 


559 


S 


ASCARISSUUM 


464 


PMS 


PAPIO hamadryas 


567 


S 


AEDES ALBOPICTUS 


480 


DBG3KQRSVX 


CHIMPANZEE 


572 


S 


LOUGO BLEEKERI 


4S1 


KXK 


PYGMY CHIMPANZEE 


573 


S 


HAMSTER 


524 


DKRS 


GORILLA 


580 


S 


RAT LIVER 


528 


DDFKLLLRWW 


ORANGUTAN 


584 


S 


BOVINE UVER 


536 


BGOOXRSSSTVWXX 


HUMAN 


588 


N 


HUMAN 


588 


8 
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Table 2, Format of tRNA sequences in the databank 



PART ONE: SEQUENCES OF tRNA -GENES 



Anticodon 
Organism 



accept D-do.nain anticodon domain variable region T-domaln accep 

stent extra loop stem 

01234567891111111 11 11222222222222333333333344 4 444eeoeeeeeeeeeeeee<!ee4444SSS555555566666666667777777 
012345677890001234S67890123456789012345111111H2345222222267B901234567B901234567890123456 



Kingdom 



DA0260 TGC PHAGE T5 VIRUS -GGGCGAATAGTGTCAGC-GGG- 

QA0340 TGC ARC HAEG LODU S FULG. ARCHAE -GGGCTCGTAGCTCAGC--GGG-' 

DA0380 TGC HALOBACTERIUM CUT. ARCHAE - QGGCCCATAGCTCAGT - -GGT— 

DA04 20 TGC HALOBACTERIUM HAL. ARCHAE -GGGCCCATAGCTCAGT--GGT- 

DAO50O TGC METHANOBAC. FORM I . ARCHAE -GGGCCCGTAGCTCAGACTGGG- • 

DAO620 TGC METHANOBAC . THERM . ARCHAE -GGGCCCGTAGCTCAGACTGGG- 

DA0650 TGC METHANOCOCCUS JAN. ARCHAE -GGGCCCGTAGCTCAGCT-GGG- 

DA0651 GGC METHANOCOCCUS JAN. ARCHAE -GGGCTGGTAGCTCAGACTGGG- 

DA0660 TGC METHANOCOC . VANI . ARCHAE -GGGCCCGTAGCTCAGTT -GGG- 

DA0670 TGC METHANOTHRIX SOEH. ARCHAE -GGGCTTGTAGCTCAGCT-GGT- 

DA0680 TGC METHANOTHERM . FER. ARCHAE -GGGCCCATAGCTCAGCCTGGG- • 

DA07 80 TGC METHANOSPIR. HUNG. ARCHAE -GGGCTCGTAGCTCAGCT -GGA- 

DAO940 TGC THERMOCOCCUS CELER ARCHAE -GGGCCGGTAGCTCAGCCTGGG- 

DA09BO TGC THERMOPROT. TEN AX ARCHAE -GGGCCGGTAGTCTAGC — GGA- 

DA0981 CGC THERMOPROT. TEN AX ARCHAE -GGGCCGGTAGTCTAGC— GGA- 

DA1U0 TGC BARTONELLA ELIZAB. EUBACT -GGGGCCGTAGCTCAGCT-GGG- 

DA1130 TGC BARTONELLA QUINT. EUBACT -GGGGCCGTAGCTCAGCT-GGG- 



■AGCACACCAGACTTGCAATCTGGTA G-GGAGGGTTCGAGTCCCTCTTTGTCCACCA 

•AGAGCGCCXjCCITTGCGAGGCGGAG- 

•AGAGTGCCTCCTTTGCAAGGAGGAT GCCCTGGGTTCGAATCCCAGTGGGTCCA 

• AGAGTGCCT CCTTTGCAAGGAGGAT GCCCTCGGTTGGAATCCCAGTGGGTCCA- ~ 

'AGAGCGCCGCCCTTGCAAGGCGGAG 

-AGAGCGCCGGCCTTGCAAGCCGGAG GCCGTGGGTTCAAATCCCACCGGGTCCA 

- AGAGCGCCGCAT TGGCTGTGCGGAG GCCGCGGGTTCAAATCCCGCCCAGTCCA- - - 

GCCGTGGGTTCAAATCCCGCCGGGTCCA- - - 

-AGAGCGCGGCGTTTGCAACGCCGAG . GCCTGGGGTTCAMTCCCCACGGGTCCA 

•AGGACGCCCGCCTCGCGCGCGGGAG ATCCCGGGTTCGAATCCCGGCCGGTCCA- • 

"AGAGCACCTGCTTTGCAAGCAGGGG 

■AGAGCACCTGCrrTGCAAGCAGGGG GTCGTCGGTTCGATCCCGTCCGGCTCCACCA 





(Continued in the databank. See text for instructions.) 



RESULTS 

Presentation of sequences 

The sequences in the database are divided into three parts. The first 
two parts contain the sequences of the tRNA genes and tRNAs, 
respectively, which can be fitted into the canonical tRNA alignment. 
The third part lists tRNA and tRNA gene sequences, mainly of 
animal mitochondria, whose secondary structures differ from most 
tRNAs and could not be aligned according to Figure 1 . 

An example for sequence presentation in the database is given 
in Table 2. Each sequence in the compilation occupies two 
consecutive lines. The first line begins with the letter *D* or 'R' 
and contains the six-position identification code of the sequence 
('D' or 'R' for DNA or RNA, respectively; a one-letter code for 
the amino acid, X for methionine-initiator, Z for selenocysteine; 
and the four-digit code specifying the organism and isoacceptor. 
After this, the sequence of the anticodon (in the case of tRNA 
sequences in its modified form) is given, followed by the name 
and the kingdom of organism (Table 1), and the sequence 
(99 standard positions). The second line begins with the sign 
and contains the information about base-pairing (double helical 
regions only, tertiary interactions are not annotated). All other 
lines in the compilation begin with signs other than *D,' 'R' or '+* 
(usually l *') and contain comments. 

Nucleotides involved in Watson-Crick pairs are marked with 
'=', the GU pairs are indicated with the sign •*'. Nucleotides 
26 and 44 are considered to form a base-pair included in the 
anticodon stem (Fig. 1). 



The sequences in orginal publications denoted as 'yeast 7 are 
assigned to Saccharomyces cerevisiae. The user should be aware, 
however, that some of these organisms have possibly been 
misclassified and that the original literature should be consulted. 

This compilation uses a one-letter code for all nucleotides 
including modified ones. For standard nucleotides, adenosine, 
cytidine, guanosine, thymidine and uridine the usual abbreviations, 
A, C, G, T and U, respectively, are used. To designate modified 
nucleotides, the other ASCII signs are employed as defined in Table 
3. Terminology and structure of the modified nucleosides occurring 
in tRNAs were used according to refs 2 and 3 . Positions in particular 
sequence which are not filled (gaps in the generalised structure, 
Fig. 1 ) are indicated by a dash. All nucleotide insertions are denoted 
by underlining at the place of insertion. 

Numbering and alignment of the variable region 

The alignment of the variable region has been done in accordance 
with Steinberg and Kisselev (4). The extra arm is placed between 
nucleotides 45 and 46. It includes two double helical strands forming 
a stem and a loop. The annotations of the nucleotides in the extra arm 
positions begin with the letter V (extra) followed by a one- or 
two-digit number. We have reserved a space for 7 bp in the stem and 
5 nt in the loop. The nucleotides in the loop are numbered from 1 
to 5, whereas the nucleotides in the stem are numbered from 1 1 to 
17 (5 '-branch) and from 27 to 21, in the reverse order, (3 '-branch), 
to indicate base-pair formation between nucleotides 11-21, 12-22, 
etc. (Fig. 1 ). In the tRNAs where the extra arm position 45 is empty 
but where the nucleotides 46-48 between the extra arm and 
T-domain are present, the positions will be filled in the order 48, 46, 



Nucleic Acids Research. 1998, Vol 26, No. 1 153 



Table 3. Modified nucleosides in tRNA and their abbreviations 



One-letter code of nucleotidea 

Symbol [2,3] 

. V 



u 


U 


c 


C 


A 


A 


G 


G 


T 


T 


(underline) 


H 


?A 




mlA 


/ 


m2A 


+ 


i6A 


* 


nu2i6A 




m6A 


6 


t6A 


E 


m6l6A 


I 


ms2t6A 




Am 


I 


I 


0 


mil 


A 


ai(p) 




io6A 


< 


?C 


% 


s2C 


B 


Cm 


M 


ac4C 


? 


m5C 




m3C 


} 


k2C 


> 


f5C 


o 


13Cm 



Name [2,3] 
V 

uridine 
cytidine 



thymine (far sequences of tRNA genes amy) 
empty position 

insertion (see footnote for further information) 
unknown nucleotide 

unknown modified adenosine 

1 •methyladenosine 

2 -methyl adenosine 

r^-bopentrrryladenosine 

2-methylthio-N^-isopenleiryladenosinc 

N^-methyiadenosine 

N^4hrcxmylcaibamoyladenosine 

bf^Hnetrryl-hf^-threorry^ 

2^nethyltbic~N^^eorrylcaxbamoyladenosme 

y-O-methyladenosine 



1- methylinosine 

2 l -p<5iJhospho)ribosyladenosine 
Nr-(cis-rrydioxyisopentenyl)adenosine 

unknown modified cytidine 

2- thiocytidine 
2^metlrylcytidine 
N^-ace^lcytidine 
5-methylcytidine 

3- methylcytidine 
lysidine 
5-fotmylcytidin 

2 ' -O-raethyl- 5 -forrnylcytidin 





70 


s 


Qr(p) 


K 


mlG 


L 


m2G 


0 


Gm 


R 


m22G 


I 




7 


m7G 


( 


ft7d7G 


Q 


Q 


8 


manQ 


9 


galQ 


Y 


yW 


W 


o2yW 


N 


7U 


{ 


mnroSU 


2 


s2U 


J 


Um 


4 


s4U 


& 


ncm5U 


1 


mcm5U 


S 


mnm5s2U 


3 


mctn5s2U 


V 


cmo5U 


5 


mo5U 


1 


cmnroSTJ 


$ 


cmnmSs2U 


X 


acp3U 




mchmSU 


) 


cmnmSUm 




ncmSUm 


D 


D 


P 




] 




Z 


Tm 


T 


■ miU 


F . 


m5s2U 


\ - 


raSUm 



unknown modified guanosine 
2 , -0<5-phospi»)nlx)sylguanosine 

1- mcthylguanosine 
N -methylguanosine 

2- O-qtethyIguanosine 
N^Jnlimetbylguanosinc 

N ,N .y-O-trniwthylguanosine 

7-methylguanosine 

archaeosinc 



mannosyl-queuosine 
galactosyl-qucuosine 
wyfautosine 
peroxywybutosine 

unknown modified uridine 
5-nicthylaminnmethyluridino 

2- thiouridine 
^-O-rnethyliiridine 

4- thiburidine 

5- caitamoylmethyluridine • 
5-methaxycarborry Imethylun dine 
5-methylaminonieujyl-24ruouridiM 
5-methcocycarbonylmethyW 
uridine 5-oxyacetic acid 
5-mcthoxyuridine 

5-carbo^roethylaminomethyluridine 
5-carboxyniethylarninomethyl-2-thiourid 

3- (3-anuno-3-caibaxypropyl)uridino 
5-<carboxyhydrcixymethyl)uridmeme4hyl ester 
5-carboxymethylammQmcthyl-2'-0-methylundme 
5 orbamoylmethy l-2'-0-methy luridine 
dihydrouridine 

pseudouridine 

1 -methylpseudouridine 

2'-Omcthylpseudouridine 

ribosylthymine 

5-methyl-2-thiouridino 

5, 2^0niimethyluridine 



47, i.e., tRNAs use position 48, 46 and 47 for the first, second and 
third nucleotide, respectively, depending on the length of the 
sequence in this region. A similar situation occurs in tRNAs without 
a long extra arm, where the most variable position 47 is deleted in 
many sequences. 



Alignment of animal mitochondrial tRNAs 

In properly aligned tRNA sequences, nucleotides occupying the 
same position in different tRNA sequences should play a 
comparable structural or functional role. Most animal mitochondrial 
tRNAs cannot be easily aligned with other tRNAs mainly because 
of the absence of information on their three-dimensional structure. 
Experimental data, however, point to the existence of tertiary 
interactions in these tRNAs. In this compilation, we use an 
alignment which accounts for these interactions as much as possible. 
Where we could do so, the animal mitochondrial tRNAs were 
included in Parts I and IT. The alignment of animal mitochondrial 
tRNA is, however, not yet unambiguous. 

Some animal mitochondrial tRNAs have completely unusual 
secondary structure and cannot be fitted in the tRNA alignment used 
here (Parts I and 0). We treated these sequences separately including 
them into Part HI. Here, each particular sequence has its own 



alignment. To this group belong the tRNAs from: (i) mitochondria 
of a parasitic worm lacking the T- or D-domain, (ii) mitochondria of 
mollusks, insects and echinoderm, with extended anticodon and 
T-stems and (iii) mammalian mitochondria, lacking the D-domain. 

For some tRNA genes the secondary structure pattern cannot be 
clearly established. We have also included these sequences in Part 
IE. It is possible that posttranscriptional modifications of these 
tRNAs will result in improvement of the secondary structure. 
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